Multi-Path Feedback Recurrent Neural Network for Scene Parsing
نویسندگان
چکیده
In this paper, we consider the scene parsing problem. We propose a novel Multi-Path Feedback recurrent neural network (MPF-RNN) to enhance the capability of RNNs on modeling long-range context information at multiple levels and better distinguish pixels that are easy to confuse in pixel-wise classification. In contrast to CNNs without feedback and RNNs with only a single feedback path, MPFRNN propagates the contextual features learned at top layers through weighted recurrent connections to multiple bottom layers to help them learn better features with such “hindsight”. Besides, we propose a new training strategy which considers the loss accumulated at multiple recurrent steps to improve performance of the MPF-RNN on parsing small objects as well as stabilize the training procedure. We empirically demonstrate that such an architecture with multiple feedback paths can effectively enhance the capability of deep neural networks in classifying pixels which are hard to distinguish without higher-level context information. With these two novel components, MPF-RNN provides new state-of-the-art results on four challenging scene parsing benchmarks, including SiftFlow, Barcelona, CamVid and Stanford Background.
منابع مشابه
Multi-Path Feedback Recurrent Neural Networks for Scene Parsing
In this paper, we consider the scene parsing problem and propose a novel MultiPath Feedback recurrent neural network (MPF-RNN) for parsing scene images. MPF-RNN can enhance the capability of RNNs in modeling long-range context information at multiple levels and better distinguish pixels that are easy to confuse. Different from feedforward CNNs and RNNs with only single feedback, MPFRNN propagat...
متن کاملHierarchical Feature For Scene Parsing Using Fully Recurrent Network
In scene parsing, the wide-range contextual information is not effectively encoded. Scene parsing provides segmentation and determines an scene into different regions associated with semantic categories. The main objective of scene parsing is to reduce semantic gap between humans and computer machines on scene understanding. The scenes parsing applications are object detection, text detection o...
متن کاملGeometric Scene Parsing with Hierarchical LSTM
This paper addresses the problem of geometric scene parsing, i.e. simultaneously labeling geometric surfaces (e.g. sky, ground and vertical plane) and determining the interaction relations (e.g. layering, supporting, siding and affinity) between main regions. This problem is more challenging than the traditional semantic scene labeling, as recovering geometric structures necessarily requires th...
متن کاملDesigning Path for Robot Arm Extensions Series with the Aim of Avoiding Obstruction with Recurring Neural Network
In this paper, recurrent neural network is used for path planning in the joint space of the robot with obstacle in the workspace of the robot. To design the neural network, first a performance index has been defined as sum of square of error tracking of final executor. Then, obstacle avoidance scheme is presented based on its space coordinate and its minimum distance between the obstacle and ea...
متن کاملA vector matrix real time backpropagation algorithm for recurrent neural networks that approximate multi-valued periodic functions
A vector matrix real time backpropagation algorithm for recurrent neural networks that approximate multi-valued periodic functions," Received Unlike feedforward neural networks (FFNN) which can act as universal function ap-proximators, recursive, or recurrent, neural networks can act as universal approximators for multi-valued functions. In this paper, a real time recursive backpropagation (RTR...
متن کامل